Multiagent Deep Reinforcement Learning for Wireless-Powered UAV Networks
نویسندگان
چکیده
Unmanned aerial vehicles (UAVs) have attracted much attention lately and are being used in a multitude of applications. But the duration sky remains to be an issue due their energy limitation. In particular, this represents major challenge when UAVs as base stations (BSs) complement wireless network. Therefore, execute missions sky, it becomes beneficial wirelessly harvest from external adjustable flying sources (FESs) power onboard batteries avoid disrupting trajectories. For purpose, transfer (WPT) is seen promising charging technology keep flight allow them complete missions. work, we leverage multiagent deep reinforcement learning (MADRL) method optimize task between FESs UAVs. The optimization performed by carrying out three essential tasks: 1) maximizing sum-energy received all based on using WPT; 2) optimizing loading process ground BS; 3) computing most energy-efficient trajectories while duties. Furthermore, ensure high-level reliability transmission, use directional for both laser beams beam-forming technologies, respectively. study, simulation results show that proposed MADRL has efficiently optimized consumption FESs, which translates into significant gain compared baseline strategies.
منابع مشابه
Deep-Reinforcement Learning Multiple Access for Heterogeneous Wireless Networks
This paper investigates the use of deep reinforcement learning (DRL) in the design of a “universal” MAC protocol referred to as Deep-reinforcement Learning Multiple Access (DLMA). The design framework is partially inspired by the vision of DARPA SC2, a 3-year competition whereby competitors are to come up with a clean-slate design that “best share spectrum with any network(s), in any environmen...
متن کاملDeep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks
We consider a dynamic multichannel access problem, where multiple correlated channels follow an unknown joint Markov model. A user at each time slot selects a channel to transmit data and receives a reward based on the success or failure of the transmission. The objective is to find a policy that maximizes the expected long-term reward. The problem is formulated as a partially observable Markov...
متن کاملThroughput Maximization for UAV-Enabled Wireless Powered Communication Networks
This paper studies an unmanned aerial vehicle (UAV)-enabled wireless powered communication network (WPCN), in which a UAV is dispatched as a mobile access point (AP) to serve a set of ground users periodically. The UAV employs the radio frequency (RF) wireless power transfer (WPT) to charge the users in the downlink, and the users use the harvested RF energy to send independent information to t...
متن کاملMinimum Throughput Maximization in UAV-Aided Wireless Powered Communication Networks
This paper investigates unmanned aerial vehicle (UAV)-aided wireless powered communication network (WPCN) systems where a mobile access point (AP) at the UAV serves multiple energyconstrained ground terminals (GTs). Specifically, the UAVs first charge the GTs by transmitting the wireless energy transfer (WET) signals in the downlink. Then, by utilizing the harvested wireless energy from the UAV...
متن کاملMultiagent cooperation and competition with deep reinforcement learning
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological, social, or technological niche. In the present work we study how cooperation and competition emerge between autonomous agents that learn by reinforcement while using only their raw visual input as the state representation. In particular, we extend the Deep Q-Learning framework to multiagent env...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Internet of Things Journal
سال: 2022
ISSN: ['2372-2541', '2327-4662']
DOI: https://doi.org/10.1109/jiot.2022.3150616